Classifying Web pages employing a probabilistic neural network

نویسندگان

  • Ioannis Anagnostopoulos
  • Christos Anagnostopoulos
  • Vassilis Loumos
  • Eleftherios Kayafas
چکیده

This paper proposes a system capable of identifying and categorising web pages, on the basis of information filtering. The system is a three layer Probabilistic Neural Network (PNN) with biases and radial basis neurons in the middle layer and competitive neurons in the output layer. The domain of study involves the e-commerce area. Thus, the PNN scopes to identify e-commerce web pages and classify them to the respective type according to a framework, which describes the fundamental phases of commercial transactions in the web. The system was tested with many types of web pages demonstrating the robustness of the method, since no restrictions were imposed except for the language of the content, which is English. The probabilistic classifier was used for estimating the population of specific e-commerce web pages. Potential applications involve surveying web activity in commercial servers, as well as web page classification in largely expanding information areas like e-government or news and media.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Intelligent Information System for Detecting Web Commerce Transactions

This paper proposes an algorithm for detecting web transactions through web page classification. The algorithm is implemented over a generalised regression neural network and detects e-commerce pages classifying them to the respective transaction phase according to a framework, which describes the fundamental phases of commercial transactions in the web. Many types of web pages were used in ord...

متن کامل

A Technique for Improving Web Mining using Enhanced Genetic Algorithm

World Wide Web is growing at a very fast pace and makes a lot of information available to the public. Search engines used conventional methods to retrieve information on the Web; however, the search results of these engines are still able to be refined and their accuracy is not high enough. One of the methods for web mining is evolutionary algorithms which search according to the user interests...

متن کامل

Designing of a New Transformer Ground Differential Relay Based on Probabilistic Neural Network

Low- impedance transformer ground differential relay is a part of power transformer protection system that is employed for detecting the internal earth faults. This is a fast and sensitive relay, but during some external faults and inrush current conditions, may be exposed to maloperation due to current transformer (CT) saturation. In this paper, a new intelligent transformer ground differentia...

متن کامل

تشخیص ناهنجاری روی وب از طریق ایجاد پروفایل کاربرد دسترسی

Due to increasing in cyber-attacks, the need for web servers attack detection technique has drawn attentions today. Unfortunately, many available security solutions are inefficient in identifying web-based attacks. The main aim of this study is to detect abnormal web navigations based on web usage profiles. In this paper, comparing scrolling behavior of a normal user with an attacker, and simu...

متن کامل

Non-linear Least Squares Features Transformation for Improving the Performance of Probabilistic Neural Networks in Classifying Human Brain Tumors on MRI

The aim of the present study was to design, implement, and evaluate a software system for discriminating between metastases, meningiomas, and gliomas on MRI. The proposed classifier is a modified probabilistic neural network (PNN), incorporating a second degree least squares features transformation (LSFT) into the PNN classifier. Thirty-six textural features were extracted from each one of 75 T...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IEE Proceedings - Software

دوره 151  شماره 

صفحات  -

تاریخ انتشار 2004